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1-INTRODUCTTON: 

Perhaps  the  most  innovative  technical  development  over  the  last  ten  years 
in  the  field  of  control  has  been  the  introduction  of  artificial  neural  networks 
(ANN)  methods  for  identification,  modeling  and  control  [1,2].  The  basic 
concept  of  artificial  neural  nets  stems  fi-om  the  idea  of  modeling  individual  brain 
cells/neurons  in  a  fairly  simple  way,  and  then  connecting  these  models  in  a 
highly  parallel  fashion  to  offer  a  complex  processing  mechanism  which  exhibits 
learning  in  terms  of  its  overall  nonlinear  characteristics.  Actual  brain  cells  are 
not  of  one  particular  form.  They  are  not  all  identical,  whereas  at  present  artificial 
neural  networks  tend  to  consist  of  one  type  of  neuron.  Further,  the  overall  make 
up  of  a  brain,  in  terms  of  coimectivity  and  structure,  is  highly  complex  and  not 
well  understood,  whereas  artificial  neural  networks  are  generally  well  structmed 
and  simply  coupled,  thereby  enabling  the  possibility  of  imderstanding  their 
mode  of  operation. 

In  terms  of  a  control  systems  environment,  the  majority  of  practical 
controllers  actually  in  use  are  both  simple  and  linear,  and  are  directed  towards 
the  control  of  a  plant  which  is  either  reasonably  linear  or  at  least  linearisable. 


Neural  Networks  for  Robot  Control  -  Final  Report  -  Prof.  Chaiban  NASR 


Page  1/15 


However,  it  is  sufficient  to  state  that  a  neural  network  is  usually  a  complex 
nonlinear  mapping  tool,  and  the  use  of  such  a  device  on  relatively  simple  linear 
problems  makes  very  little  sense  at  all,  being  a  case  of  over-skill  [3].  The  fact 
that  neural  networks  have  the  capability  of  dealing  with  complex  non-linearity 
in  a  fairly  general  way  is  an  exciting  feature.  By  their  nature,  nonlinear  systems 
are  non-uniform  and  invariably  require  custom  designed  control  schemes  to  deal 
with  individual  characteristics.  No  general  theory  deals  comprehensively  with 
the  wide  range  of  nonlinear  systems  encoimtered,  so  an  approach,  namely, 
neural  networks  that  can  offer  a  fairly  broad  coverage  are  therefore  extremely 
attractive.  By  viewing  neural  networks  with  control  applications  clearly  in  mind, 
a  number  of  observations  can  be  made  [4],  the  first  of  these  being  perhaps  the 
most  significant: 

(i)  Neural  networks  can  flexibly  and  arbitrarily  map  nonlinear  functions. 
Such  networks  are  best  suited  for  the  control  of  nonlinear  systems. 

(ii)  Neural  networks  are  particularly  well  suited  to  multivariable  applications 
due  to  their  ability  to  map  interactions  and  cross-couplings  readily  whilst 
incorporating  many  inputs  and  outputs. 

(iii)  Networks  can  either  be  trained  off-line  and  subsequently  employed  either 
on  or  off-line,  or  they  can  be  trained  on-line  as  part  of  an  adaptive  control 
scheme  or  simply  a  real-time  system  identifier.  Understanding  of  a 
network’s  mode  of  operation  within  an  adaptive  controller  is,  at  the 
present  time,  extremely  limited. 

(iv)  Neural  networks  are  inherently  parallel  processing  devices  which  exhibit 
to  an  extent,  at  least,  fault  tolerant  characteristics. 

The  goal  of  the  present  work  is  a  simulating  investigation  of  the  use  of 
artificial  neural  networks  in  the  control  of  robot  manipulators  and  the 
application  of  this  knowledge  for  the  search  of  optimum  architectures  of  the 
ANN. 


2 -2D  PLANAR  ARM; 

2-1  -  Position  of  the  problem; 
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Our  focus  in  this  section  is  to  provide  the  design  fundamentals  for 
artificial  neural  networks  used  in  robotics  systems.  Trajectory  control  of  robotic 
manipulators  traditionally  consists  of  following  a  preprogrammed  sequence  of 
end  effector  movements.  Robot  control  usually  requires  control  signals  applied 
at  the  joints  of  the  robot  while  the  desired  trajectory,  or  sequence  of  arm  end 
positions,  is  specified  for  the  end  effector. 

Figure  1  shows  the  geometry  of  an  idealized  planar  robot  manipulator 
with  two  degrees  of  freedom.  The  robot  arm  operates  in  a  plane  (2D).  To  make 
the  arm  move,  the  desired  coordinates  of  the  end  effector  point  (x;  y)  are  fed  to 
the  artificial  neural  network  so  that  it  generates  the  joint  angles  (0„  Gj)  for  the 
motors  that  move  the  arms.  To  perform  end  effector  position-control  of  a  robotic 
manipulator,  two  problems  need  to  be  solved: 

(i)  Inverse  kinematics  problem:  given  the  Cartesian  coordinates  of  the  end 
effector,  specified  either  as  a  single  point  or  as  a  set  of  points  on  a 
trajectory,  joint  angles  or  a  set  of  joint  angles  need  to  be  foimd. 

(ii)  Target  position  control:  Given  the  final  end  effector  position,  a  joint- 
angles  sequence  suitable  for  achieving  the  final  position  needs  to  be 
found. 


The  forward  and  inverse  kinematics  problems  can  be  formulated 
respectively  as: 


x  =  h(0y 

(1) 

0  =  h-*(x) 


In  those  expressions  0  and  x  are  joint  angles  and  end  effector  Cartesian 
coordinate  vectors,  respectively,  which  are  defined  as  follows: 

0  =  [0,  0,]^ 

X  =  [x,  Xj]^  X,  =  X  and  Xj  =  y 

Even  though  the  numerical  solution  of  (1)  can  be  found,  this  requires 
large,  real-time  computational  resomces.  The  neural  network  approach  may  be 
used  here  to  solve  both  the  problems  of  kinematics  task  formulation  and  the 
setting  up  of  the  equations.  It  also  allows  for  circumvention  of  the 
computational  complexity  involved  in  their  numerical  solution. 

2-2  -  Multi-laver  perceptrons  method  fMLPI: 

Multi-layer  perceptrons  [5,  6]  provide  one  arrangement  for  neural  network 
implementation,  by  means  of  nonlinear  relationships  between,  firstly,  the 
network  inputs  to  outputs  and,  secondly,  the  network  parameters  to  outputs. 
Such  a  network  consists  of  a  number  of  neuron  layers,  n,  linking  its  input  vector, 
u,  to  its  output  vector,  y,  by  means  of  the  equation: 

y  =  9ii(W„(p„.,(W„., . .  .(piCWjU  +  b,)  +. . .+  b„.|)  +  bj 
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In  which  Wj  is  the  weight  matrix  associated  with  the  layer,  (pj  is  a 
nonlinear  operator  associated  with  the  ith  layer  and  b;  indicate  threshold  or  bias 
values  associated  with  each  node  in  the  ith  layer.  The  function  (pj  is  a  sigmoid 
function  for  all  n. 

It  is  known  in  reality  that  real  neurons,  located  in  different  areas  of  the 
nervous  system,  have  different  modes  of  behavior  [7]  ranging  from  Gaussian- 
like  for  visual  needs  to  sigmoid  for  ocular  motor  needs.  It  is  generally  the  case 
for  artificial  neural  networks,  however,  that  only  one  type  of  non-linearity  is 
employed  for  a  particular  network,  this  linking  in  closely  with  the  fact  that  each 
network  is  only  employed  for  one  particular  task. 


Fig.2;  -  A  two-layer  multi-layer  perceptron. 


Figure  2  shows  a  fully  connected  network  in  that  all  of  the  neuron  outputs 
in  one  layer  of  the  network  are  connected  as  inputs  to  the  next  layer.  This  is 
normal  practice:  however,  it  is  quite  possible  for  part-connectivity  to  be  realized 
by  connecting  a  group  of  outputs  to  only  specific  input.  By  this  means  sub¬ 
models  can  be  formed  within  the  overall  MLP,  and  these  can  be  particularly 
useful  where  a  specific  system  characteristic  is  to  be  dealt  with  or  inputs/outputs 
from  the  system  can  be  categorized  into  certain  types. 

Whatever  the  network  connectivity,  key  questions  in  the  use  of  MLP  are 
how  many  layers  there  should  be  and  how  many  neurons  there  should  be  in  each 
layer.  Once  these  structural  features  of  the  network  have  been  selected,  it 
remains  for  the  adjustable  weights  to  be  settled  on  such  that  the  network  is 
completely  specified  in  terms  of  its  functionality. 


2-3  -  Kinematics  back  oroDagation  learnin 


By  far  the  most  popular  method  employed  for  weight  training  in  MLP 
neural  networks  is  called  back  propagation  [8].  In  the  standard  feed- forward 
MLP  network,  back  propagation  solves  the  problem  of  missing  information  to 
the  hidden  layers,  neither  the  input  to  nor  the  reference  signals  for  the  hidden 
layers  are  known.  This  solving  problem  is  obtained  by  taking  the  inputs  to  the 
hidden  layers  as  being  the  inputs  to  the  first  layers  propagated  through  the 
network.  The  reference  signals  for  the  hidden  layers  are  then  obtained  by  error 
back  propagation  through  the  network.  This  is  realized  by  obtaining  the  partial 
derivative  of  the  squared  error  with  respect  to  the  parameters. 
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It  is  worth  pointing  out  that  the  back-propagation  algorithm  has  also  been 
used  for  weight  learning  in  feedback  neural  networks  [9,  10],  these  being 
networks  in  which  the  network  structure  incorporates  feedback,  whereby  the 
output  of  every  neuron  is  fed  back,  in  weighted  form,  to  the  input  of  every 
neuron.  The  architecture  of  such  a  network  is  inherently  dynamic  and  realizes 
powerful  capabilities  due  to  its  complexity. 

Consider,  as  a  starting  point,  a  single  neuron  with  output  yj;  then 


Vi 


=  - - z - r 

l  +  exp(-Xi) 


In  which: 

m 

i=l 

In  this  expression  Wq  is  a  bias  term.  If  it  is  assumed  that  at  an  instant  in 
time,  for  an  input  Uj  the  output  y;  should  be  equal  to  the  desired  output  y<j,  then 
the  squared  error  of  the  output  signal  is  given  by: 

Ei  =^(yd-yi)"  =^ef 

And  it  is  desired  to  minimize  Ej  by  means  of  a  suitable/best  choice  of  the 
weighting  coefficients  Wy . 

Consider  the  problem  of  minimizing  the  scalar  error  function  E  (W), 
where  W  is  a  vector  of  weights  to  be  adjusted  by  means  of  an  interactive 
procedure  generating  a  number  of  searching  points,  W(k),  such  that: 

W(k+1)  =  a(k).W(k)  +  Ti(k).d(k) 

In  this  relation  an  initial  set  of  weightings  W(0)  is  made  through  prior 
knowledge,  by  a  reasoned  guess  or  even  relatively  randomly.  The  term 
a(k).W(k)  represent  a  momentum  term  and  a(k)  is  usually  a  positive  number 
called  the  momentum  constant.  The  term  d(k)  indicates  the  search  direction, 
whereas  r\(k)  indicates  the  length  of  search  step  or  the  amoimt  of  learning  to  be 
carried  out. 

In  this  way,  the  weights  associated  with  one  neuron  can  be  adjusted  in 
order  to  minimize  the  squared  error.  The  approach  can  then  be  extended  in  order 
to  adjust  all  of  the  weights  in  the  MLP  network.  So,  overall,  a  set  of  inputs  and 
desired  output  data  values  is  used  to  train  the  entire  network.  The  input  set  also 
realizing  a  corresponding  set  of  network  weights  such  that  the  error  between  the 
desired  output  signals  and  the  actual  network  output  signals  is  minimized  in 
terms  of  the  average  overall  learning  points.  The  back  propagation  algorithm 
employs  the  steepest-descent  method  to  arrive  at  a  minimum  of  the  mean 
squared  error  function.  For  one  specific  data  pair,  the  error  squared  can  be 
written  as: 
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1  m  1  m 

Ek  =-£(ydk-yik)' 

^  i=l  ^  i=l 

Where  m  neurons  are  assumed  to  be  present  in  the  architecture  of  the 
neural  network,  and  yj^  is  the  ith  neuron’s  kth  output  value. 

The  global  error  is  then  found  by  minimizing  over  all  the  data  set.  If  the 
niunber  of  data  values  that  are  present  is  N,  then  we  have: 

N 

e=2;e^ 

k=l 

This  error  function  can  then  be  minimized  in  batch  mode  or  recursively  in 
an  on-line  manner. 

The  network  is  now  fully  trained  on  the  data  presented  and  can  be 
employed  with  any  further  data,  although  it  may  be  desirable  to  present  the  data 
again  cyclically  imtil  the  overall  error  falls  below  a  previously  defined  minimum 
value,  i.e.  until  the  weights  converge.  An  important  feature  then  emerges  in  that 
the  MLP  network  has  the  ability  to  generalize  when  it  is  presented  with  new 
data  not  previously  dealt  with. 

2-4  -  Numerical  simulations: 

The  first  numerical  simulation  aims  to  study  the  application  of  neural 
networks  to  an  arc  of  circle,  then  to  the  entire  circle  and  finally  to  the  entire 
disk.  The  robot  is  required  to  learn  Cartesian  Coordinates  (x;y)  for  the  circular 
trajectory  of  radius  r.  A  network  with  two  inputs  (0,  and  Gj),  a  variable  number 
of  hidden  nodes,  and  two  output  neurons  yielding  outputs  (x;  y)  has  been 
selected  to  simulate  the  arc  of  a  circle,  all  the  circle  and  the  entire  disk. 

The  constraint  on  the  circular  trajectory,  which  needs  to  be  learned,  is: 

X  =  r  cos(P) 
y  =  rsin(P)  (2) 

In  this  expression  the  angle  P  is  covering  the  first  quadrant.  The  training 
has  been  performed  for  1,  =  3,  Ij  =2  and  r  =  5  at  10  points  of  selected  trajectory. 
Different  networks  has  been  trained  using  the  error  back-propagation  technique 
with  learning  constant  q  =  0.1  and  a  =  0.5  for  the  momentum  constant.  The 
training  data  of  the  trajectory  with  r  =  5  contains  angle  p  between  5°  and  85°, 
spaced  by  10°,  and  p  =  90°. 

The  results  of  training  are  shown  in  Figure  5.  The  2-8-2  multi-layer 
perceptron  represent  the  optimal  network.  In  this  case,  a  minimum  error  of  10^ 
is  obtained  after  250  cycles. 

In  the  case  of  a  circular  trajectory,  the  constraint  that  must  be  learned,  is 
given  by  equations  (2)  where  angle  p  is  covering  the  four  quadrants.  The 
training  has  been  performed  for  Ij  =  3,  Ij  =2  and  r  =  5  at  40  points  of  the  entire 
trajectory.  Different  networks  has  been  trained  using  the  error  back-propagation 
technique  with  learning  constant  q  =  0.1  and  a  =  0.1  for  the  momentum 
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constant.  The  results  of  training  are  shown  in  Figure  6.  The  2-8-8-2  multi-layer 
perceptron  represent  the  optimal  network.  In  this  case  a  miniTmim  error  of  0.02 
is  obtained  after  100  cycles. 

Finally,  in  the  case  of  a  disk,  the  training  has  been  performed  for  1,  =  3, 
I2— 2  and  r  =  5  at  180  points  of  entire  disk.  Different  networks  has  been  trained 
using  the  error  back-propagation  technique  with  learning  constant  t]  =  0.1  and 
for  the  momentum  constant.  The  training  data  of  the  trajectory  with  a 
radius  r  =1:2:3 :4  and  5  contains  angle  P  between  5°  and  355°,  spaced  by  10°,  and 
P=360°. 

The  results  of  training  are  shown  in  Figure  7.  The  2-8-8-2  multi-layer 
perceptron  represent  the  optimal  network.  In  this  case  a  minimum  error  of  0.3  is 
obtained  after  1000  cycles. 

3  -  3D  ARTICULATED  ROBOT  ARM: 

3-1  -  Position  of  the  problem! 

In  this  section,  our  focus  is  the  study  of  an  idealized  3D  articulated  robot 
arm  by  using  the  kinematics  back-propagation  method  for  the  adjustment  of  the 
parameters.  It  consists  also  in  searching  for  optimal  architectures  in  this  case. 


Fig.  3:  Idealized  articulated  robot  arm. 

Figure  3  shows  the  geometry  of  the  idealized  3D-articulated  robot  arm  (RRR) 
with  three  degrees  of  freedom.  The  robot  arm  operates  in  the  space.  To  make  the 
arm  move,  the  desired  coordinates  of  the  end-effector  point  (x,  y,  z)  are  fed  to 
the  artificial  neural  network  so  that  it  generates  the  joint  angles  (Gj,  02,  63) 
necessary  to  control  the  motors  that  must  move  the  arms. 

3-2  -  Numerical  simulatinn; 

The  numerical  simulation  of  the  3D  articulated  robot  arm  aims  to  study 
the  application  of  neural  networks  to  the  entire  sphere  of  radius  8.  The  training 
has  been  performed  for  1,  =  1  (first  link),  I2  =  3  (second  link)  and  I3  =  4  (third 
link).  In  this  case,  an  important  number  of  data  for  learning  process  is  needed 
because  we  have  to  consider  the  entire  sphere  corresponding  to  the  workspace  of 
the  robot  arm.  So,  we  have  divided  the  workspace  into  five  spheres  of  respective 
radius  r  =  8,  7, 5, 4  and  2.  We  have  also  considered  different  points  verifying  the 
equations: 
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X  =  r  cos(0)cos((p) 
y  =  r  sin(0)cos(9) 

And  z  =  r  sin  (9) 

Where:  -  9  varies  from  -60°  to  +60°  with  a  step  of  20°. 

-  0  varies  from  0°  to  350°  with  a  step  of  10°  for  the  spheres  of  radius  8  and  7 

-  0  varies  from  0°  to  340°  with  a  step  of  20°  for  the  spheres  of  radius  5  and  4 

-  0  varies  from  0°  to  324°  with  a  step  of  36°  for  the  sphere  of  radius  2 


Also,  we  have  introduced  twelve  singular  points:  (0,0, -8);  (0,0,-7);  (0,0,-6); 
(0,0,-5);  (0,0,-4);  (0,0,-3);  (0,0,3);  (0,0,4);  (0,0,5);  (0,0,6);  (0,0,7);  (0,0,8). 


Different  networks  have  been  trained  for  the  research  of  optimal  architecture 
[11,  12]  using  the  kinematics  back-propagation  error  with  different  learning 
constants  and  different  momentum  constants.  The  results  obtained  in  the 
numerical  simulation  are  shown  in  figure  8.  The  learning  constant  t)  =  0.1  and 
the  momentum  constant  a  =  0.1  gives  the  best  convergence.  The  optimal 
network  obtained  is  the  (3-23-23-3)  multi-layer  perceptron  with  a  minimal  error 
of  0.9  obtained  after  750  cycles. 

4  -  RBF  NETWORKS  APPLIED  TO  2D  PLANAR  ARM; 

Radial  Basis  Fimction  (RBF)  networks  have  been  shown  to  fimction  very 
efficiently,  and  to  have  the  ability  to  model,  in  a  fairly  straight  forward  way,  any 
arbitrary  nonlinear  system  [13].  The  output  layer  of  such  a  network  is  merely  a 
linear  combination  of  the  hidden  layer  signals,  there  being  only  one  hidden 
layer.  RBF  networks  thus  offer  considerable  promise  in  terms  of  proving 
networks  stability  and  robustness,  mainly  because  of  the  way  in  which  the 
network  can  be  readily  defined  by  a  set  of  equations. 

Let  us  consider  an  input  vector  u.  An  element  of  the  output  vector  y  is 
determined  by  the  relation: 

k 

s(u)  =  XwiRi(u)  +  Wo 

i=l 

In  which  W;  (i  =  1,  2, . . .,  k)  are  the  network  weights,  Wq  is  a  bias  term  and  Rj  are 
activation  fimctions  of  the  form: 


Ri(u)  =  (D(||u-Ci||) 

Where  Cj  (i  =  1, 2,  ...,  k)  are  a  set  of  basis  fimction  centers  and  0(.)  is  the  radial 
basis  fimction  which  can  be  chosen  in  one  of  a  number  of  ways  [14].  In  our 
research,  we  have  chosen  a  Gaussian  form,  where: 

0(ri)  =  exp(-^) 


Neural  Networks  for  Robot  Control  -  Final  Report  -  Prof.  Chaiban  NASR 


Page  8/15 


In  which  cT;  is  a  scaling  parameter  and  rj  =||u-Ci||  represents  the  Euclidean 
distance  from  the  input  vector  u  to  the  center  location  Cj. 

In  their  most  usual  form,  as  shown  in  figure  4,  the  centers,  Cj,  are  variable  values 
and  fimction  learning,  as  for  the  MLP  case,  is  carried  out  by  adjustment  of  the 
weights  Wj . 


4-1  -  Determination  of  the  centers  and  of  the  scaling  parameters; 

As  was  the  case  with  MLP  networks,  a  structural  selection  problem  occurs 
with  RBF  networks,  this  time  being  dependent  on  how  the  centers  are  selected, 
both  in  terms  of  quantity  and  position.  The  choice  of  centers  will,  though, 
directly  affect  the  quality  of  the  fimction  approximation  obtained  from  the 
network,  with  a  bottom  line  requirement  that  the  number  of  centers  employed 
must  be  such  that  they  are  sufficiently  large  in  niunber  to  span  the  entire  input 
domain  [15].  However,  the  number  of  centers  required  tends  to  increase 
exponentially  with  regard  to  the  input  space  dimension  such  that  the  RBF 
approach  becomes  very  complicated  with  a  high  dimensional  space  [16]. 

.  A  direct  trade-off  exists  between  the  number  of  RBF  centers  selected  and 
the  quality  of  fimction  approximation  obtained.  For  a  low  dimensional/ 
complexity  problem,  only  a  small  number  of  centers  are  required.  However, 
where  the  problem  is  of  high  dimension,  a  reasonable  solution  can  be  provided 
by  the  selection  of  a  fixed,  fairly  low  number  of  centers,  it  being  realized  that 
the  approximation  provided  by  the  RBF  network  is  going  to  be,  at  least, 
reasonable.  Once  the  number  of  centers  has  been  selected  and  fixed,  it  then 
remains  for  the  actual  center  positions  to  be  chosen,  and  this  is  perhaps  better 
done  as  part  of  the  network’s  training  algorithm  with  the  resultant  values 
themselves  being  fixed.  Allowing  the  center  values  to  be  adjusted  on-line, 
although  opening  up  the  possibility  of  network  adaptability,  realizes  extremely 
difficult  problems,  firstly  with  regard  to  proof  of  network  stability,  and  secondly 
in  terms  of  algorithmically  controlling  the  nature  and  extent  of  center 
adaptation. 

The  main  choices  for  center  selection  were  taken  as  follows: 

1-  A  random  distribution  of  points  chosen  from  within  the  convex  hull  of  the 
input  data  set  to  be  the  RBF  centers. 

2-  A  random  input  vector  u  (t)  chosen  from  within  the  convex  hull  of  the  input 
data  set  is  presented  to  the  network. 
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3-  A  research  of  the  winning  neuron  which  represents  the  minimum  Euclidean 
distance  between  his  center  and  the  input  vector. 

4-  A  new  center  is  obtained  by  the  algorithm: 

Cg  (t  + 1)  =  Cg  (t)  +  Ti(u(t)  -  Cg  (t)) 

Where  Cg(t  +1)  is  the  new  winning  center,  Cg(t)  is  the  previous  winning  center 
and  u(t)  represents  the  input  data  vector  presented  to  the  RBF  network. 

5-  The  algorithm  is  stopped  when  the  Gaussian  centers  Cg„  are  well  distributed  in 
space  of  the  input  vectors,  which  means  by  minimizing  the  error  E,  given  by: 


Cgn  being  the  winning  center  related  to  the  input  vector  u 
The  scaling  parameters  are  determined  by  means  of  the  relation  [17]: 


For  a  given  K,  and  where  Cj  are  the  K-nearest  centers  to  the  winning  center 

Cg. 

The  number  of  neurons  K  is  chosen  by  simulation:  first,  we  choose  a  small 
value  of  K  and  then  we  increase  it  until  a  minimum  error  is  obtained. 

4-2  -  Results  of  simulations: 


We  have  used  the  algorithm  described  in  paragraph  4-1  with  step  learning 
Ti,  equal  to  0.01.  For  the  research  of  the  values  of  K  (number  of  nearest  centers 
to  the  winning  center),  we  have  considered  a  single  center  and  we  have 
increased  the  number  by  2  until  minimum  error  is  obtained  (figure  9).  The 
values  of  the  weights  corresponding  to  the  second  layer  of  the  RBF  network 
were  obtained  by  choosing  the  retro-propagation  method  with  momentum  t| 
equal  to  0.1  and  a  equal  to  0.1.  Different  networks  have  been  trained  for  the 
research  of  optimal  architecture  by  changing  the  number  of  the  hidden  neurons. 
The  optimal  network  obtained  is  a  (2-52-2)  that  converges  with  a  minimum 
error  of  0.39  obtained  after  300  cycles  (figure  10). 


5  -  COMMENTS  AND  GONCT  JJSTON: 

Applications  in  new  technologies  such  as  robotics,  manufacturing,  space 
technology,  and  medical  instrumentation,  as  well  as  those  in  older  technologies 
such  as  process  control  and  aircraft  control,  are  creating  a  wide  spectrum  of 
control  problems  in  which  non-linearity,  uncertainties,  and  complexity  play  a 
major  role.  For  the  solution  of  many  of  these  problems  techniques  based  on 
ANN  are  beginning  to  complement  conventional  control  techniques  [18],  and  in 
some  cases  they  are  emerging  as  the  only  viable  alternatives.  From  a  control 
theoretic  point  of  view,  ANN  may  be  considered  as  tractable  parameterized 
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families  of  nonlinear  maps.  As  such  they  have  found  wide  application  in  pattern 
recognition  problems,  which  require  nonlinear  decision  surfaces.  With  the 
introduction  of  dynamics  and  feedback,  the  scope  of  such  networks  as  identifiers 
and  controllers  in  nonlinear  dynamical  systems  has  increased  significantly. 

Our  works  aims  to  study  the  application  of  artificial  neiual  networks 
(multi-layer  perceptrons,  MLP)  for  2D  planar  robot  arm  and  3D  articulated 
robot  arm  by  using  the  kinematics  back-propagation  methods  for  the  adjustment 
of  parameters.  It  consists  also  in  searching  for  optimal  architectures  in  every 
case.  We  have  implemented  the  different  algorithms  and  we  have  obtained  the 
optimal  architecture  in  four  different  cases.  The  last  case,  concerning  the 
simulation  of  the  3D  articulated  robot-arm,  needs  an  important  number  of  data 
for  learning  process  because  we  have  to  consider  the  entire  sphere 
corresponding  to  the  workspace  of  the  robot  arm.  We  have  used  a  technical 
partition  of  the  workspace  of  the  robot  arm  to  build  the  necessary  data  for 
learning  process  in  supervised  neural  network  algorithms.  Good  results  where 
obtained  through  the  convergence  of  the  algorithms  used  in  the  research  of 
optimum  architecture  used  in  neural  networks. 

Another  target  was  the  study  of  the  application  of  artificial  neural 
networks  (Radial  Basis  Fimctions,  RBF)  for  the  2D  planar  robot  arm  by  using 
the  K-nearest  centers  algorithm  combined  with  the  back-propagation  methods 
for  the  adjustment  of  parameters.  It  consists  also  in  searching  for  optimal 
architectures.  The  results  of  the  simulation  show  a  very  important  influence  of 
the  choice  of  the  number  K,  which  represent  the  K-nearest  centers. 

Our  objectives  firom  this  work  are  the  real-time  process  (implementation 
on  DSP  of  different  architectures  in  neural  networks  and  studying  the  behavior 
of  all  the  process).  So,  now  we  are  working  on  the  research  of  optimal 
architectures  in  the  case  of  dynamical  robot  arms. 
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T-RESTIT.TS  OF  THE  SIMULATIONS 


EigsSi-Optimized  hidden  layers  in  the  case  of  an  arc  of  a  circle  of  radius  5 
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Fig.8:-Optimized  hidden  layers  in  the  case  of  a  sphere  of  radius  8 
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Fig.9:-C)ptimized  number  of  the  K-nearest  centers  to  the  winning  center. 


Fig.lOr-Optimized  hidden  layers  (RBF  network)  applied  to  a  2-D  planar  robot  arm. 


Neural  Networks  for  Robot  Control  -  Final  Report  -  Prof.  Chaiban  NASR 


Page  15/15 


